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In the Specification: 

Please delete the paragraph beginning at page 5, line 21 of the specification. 

Please delete the paragraph beginning at page 5, line 23 of the specification. 
Please delete the paragraph beginning at page 5, line 25 of the specification 

Please amend the paragraph beginning at page 5, line 28 of the specification to read as 

follows: 

-Figure 6 shows an exemplary descriptor.-- 



Please 
follows: 



amend the paragraph beginning at page 5, line 29 of the specification to read as 
-Figure 7 shows a set of e-value scores for ferritin.- 
Please delete the paragraph beginning at page 6, line 1 of the specification. 
Please delete the paragraph beginning at page 6, line 3 of the specification. 

Please amend the paragraph beginning at page 6, line 5 of the specification to read as 

follows: 

-Figure 8 shows a representative lookup table used in Q-compare or 
CompareOverWins.-- 



Please 



amend the paragraph beginning at page 6, line 7 of the specification to read as 



follows: 



-Figure 9 shows a representative block diagram of a program called RevComp.- 
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P,ease amend the paragraph beginning a. page 6, line 8 of the specification to read as 

f0l ' 0W lr lg ure 10 shows a representative flow char, showing preferred steps of a prefetred 
database search strategy for ortholog finding- 

Piease delete the paragraph beginning a. page 6, line 10 of the specification. 

P,ease delete the paragraph beginning at page 6. line 12 of the specification. 

Pfcase amend the paragraph beginning a. page 6, line .4 of the specification to read as 

'"""".Figure 1 1 shows a tepresentative flow scheme showing preferred steps for a preferred 
SEALS strategy.-- 

P,ease delete the paragraph beginning a. page 6, line 16 of the specification. 

Piease amend the paragraph beginning a, page 6, line 18 of the speefflcation to read as 

'""""--Figure 12 represents a genetic map showing a conserved iron response element in the 5' 
TJTR of mouse and human ferritin.-- 

Please delete the paragraph beginning a. page 6, line 20 of the specification. 
Please delete the paragraph beginning at page 6, line 22 of the specification. 
P,ease delete the paragraph beginning a. page 6, line 24 of the specification. 
P1 ease delete the paragraph beginning a, page 6, line 25 of the specification. 
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Piease amend the paragon beginning a, page 6, line 26 of the ***** » — - 

f0 " 0WS --F, g ure .3 shows representative flow scheme showing preferred steps for a preferred 
Structure Predictor strategy.-- 

Please delete the paraph beginning at page 6, line 28 of the specification. 

Picase delete the paragraph beginning a. page 6. line 30 of the specification. 

Piease amend the paragraph beginning a, page 7. line 1 of the specification to read as 



follows: 



-Figure 14 shows a representative structure drawing of ferritin 5'UTR- 
Please delete the paragraph beginning a, page 7. .ine 2 of the specification. 
P,ease delete the paragraph beginning at page 7, line 3 of the specification. 
Please delete the paragraph beginning at page 7, line 5 of the specification. 
Please delete the paragraph beginning a, page 7, line 6 of the specification. 
Please delete the paragraph beginning a. page 7, line 7 of the specification. 
P,ease delete the paragraph beginning a. page 7, line 9 of the specification. 
Please delete the paragraph beginning a. page 7, line 11 of the specification. 
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Ptease delete the paragraph beginning a. page 7, line 12 of the specification. 

Piease delete the paragraph beginning a. page 7, line 14 of the specification. 

Please delete the paragraph beginning at page 7, line 15 of the specification. 

Piease delete the paragraph beginning a. page 7. line 16 of the specification. 

P,ease delete the paragraph beginning at page 7, line 1 8 of the specification. 

P,ease delete the paragraph beginning at page 7, line 20 of the specification. 

Piease delete the paragraph beginning a, page 7. line 22 of the specification. 

Please delete the paragraph beginning at page 7. line 23 of the specification. 

Please delete the paragraph beginning a, page 7. line 25 of the specification. 

Please delete the paragraph beginning a, page 7. line 27 of the specification. 

Please delete the paragraph beginning a. page 7. line 28 of the specification. 



Please 



delete the paragraph beginning at page 7, line 30 of the specification. 



Please delete page 8. 

Please delete the paragraph beginning at page 9, line 1 of the specification. 
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Please delete the paragraph beginning at page 9, line 3 of the specification. 

Please delete the paragraph beginning at page 9, line 5 of the specification. 

Please amend the paragraph beginning at page 9, line 7 of the specification to read as 
follows: 

-Figure 15 shows a representative mass-spec structure probe analysis of region 1 of 
ornithine decarboxylase 3'UTR.-- 

Please delete the paragraph beginning at page 9, line 9 of the specification. 

1 

Please delete the paragraph beginning at page 9, line 1 1 of the specification. 
Please delete the paragraph beginning at page 9, line 13 of the specification. 
Please delete the paragraph beginning at page 9, line 15 of the specification. 
Please delete the paragraph beginning at page 9, line 17 of the specification. 
Please delete the paragraph beginning at page 9, line 19 of the specification. 
Please delete the paragraph beginning at page 9, line 21 of the specification. 
Please delete the paragraph beginning at page 9, line 23 of the specification. 
Please delete the paragraph beginning at page 9, line 25 of the specification. 
Please delete the paragraph beginning at page 9, line 27 of the specification. 



DOCKET NO.: IBIS-0012 

Please delete the paragraph beginning at page 9, line 29 of the specification. 

Please delete the paragraph beginning at page 9, line 30 of the specification. 
Please delete the paragraph beginning at page 10, line 1 of the specification. 
Please delete the paragraph beginning at page 10, line 3 of the specification. 

Please delete the paragraph beginning at page 10, line 5 of the specification. 

S 

Please delete the paragraph beginning at page 10, line 6 of the specification. 
Please delete the paragraph beginning at page 10, line 8 of the specification. 
Please delete the paragraph beginning at page 10, line 10 of the specification. 
Please delete the paragraph beginning at page 10, line 1 1 of the specification. 
Please delete the paragraph beginning at page 10, line 12 of the specification. 
Please delete the paragraph beginning at page 10, line 14 of the specification. 
Please delete the paragraph beginning at page 10, line 16 of the specification. 
Please delete the paragraph beginning at page 10, line 17 of the specification. 
Please delete the paragraph beginning at page 10, line 18 of the specification. 
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Please delete the paragraph beginning at page 10, line 19 of the specification. 

Please delete the paragraph beginning at page 10, line 20 of the specification. 

Please delete the paragraph beginning at page 10, line 21 of the specification. 

Please delete the paragraph beginning at page 10, line 22 of the specification. 

Please delete the paragraph beginning at page 10, line 23 of the specification. 

Please delete the paragraph beginning at page 10, line 24 of the specification. 

Please delete the paragraph beginning at page 10, line 25 of the specification. 

Please amend the paragraph beginning at page 16, line 11 of the specification to read as 
follows: 

' -Additional nucleic acid targets may be determined independently or can be selected from 
publicly available prokaryotic and eukaryotic genetic databases known to those skilled in the art. 
Preferred databases include, for example, Online Mendelian Inheritance in Man (OMM), the Cancer 
Genome Anatomy Project (CGAP), GenBank, EMBL, PIR, SWISS-PROT, and the like. OMIM, 
which is a database of genetic mutations associated with disease, was developed, in part, for the 
National Center for Biotechnology Information (NCBI). OMM is publicly available through the 
Internet at the world wide web at, for example, ncbi.nlm.nih.gov/Omim/. CGAP, which is an 
interdisciplinary program to establish the information and technological tools required to decipher 
the molecular anatomy of a cancer cell . CGAP is publicly available through the Internet at the world 
wide web at, for example, ncbi.nlm.nih.gov/ncicgap/. Some of these databases may contain complete 
or partial nucleotide sequences. In addition, nucleic acid targets can also be selected from private 
genetic databases. Alternatively, nucleic acid targets can be selected from available publications or 
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can be determined especially for use in connection with the present invention.- 



PATENT 



Please amend the paragraph beginning at page 16, line 25 of the specification to read as 

follows: 

-After a nucleic acid target is selected or provided, the nucleotide sequence of the nucleic 
acid target is determined and then compared to the nucleotide sequences of a plurality of nucleic 
acids from different taxonomic species. In one embodiment of the invention, the nucleotide 
sequence of the nucleic acid target is determined by scanning at least one genetic database or is 
identified in available publications. Preferred databases known and available to those skilled in the 
art include, for example, the Expressed Gene Anatomy Database (EGAD) and Unigene-Homo 
Sapiens database (Unigene), GenBank, and the like. EGAD contains a non-redundant set of human 
transcript (HT) sequences and is publicly available through the Internet at the world wide web at, for 
example, tigr.org/tdb/egad/egad.html. Unigene is a system for automatically partitioning GenBank 
sequences into a non-redundant set of gene-oriented clusters. Each Unigene cluster contains 
sequences that represent a unique gene, as well as related information such as the tissue types in 
which the gene has been expressed and map location.-- 

Please amend the paragraph beginning at page 17, line 10 of the specification to read as 

follows: 

--In addition, Unigene contains hundreds of thousands of novel expressed sequence tag 
(EST) sequences. Unigene is publicly available through the Internet at the world wide web at, for 
example, ncbi.nlm.nih.gov/UniGene/. These databases can be used in connection with searching 
programs such as, for example, Entrez, which is known and available to those skilled in the art, and 
the like. Entrez is publicly available through the Internet at the world wide web at, for example, 
ncbi.nlm.nih.gov/Entrez. Preferably, the most complete nucleic acid sequence representation 
available from various databases is used. The GenBank database, which is known and available to 
those skilled in the art, can also be used to obtain the most complete nucleotide sequence. GenBank 
is the NIH genetic sequence database and is an annotated collection of all publicly available DNA 
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sequences. GenBank is described in, for example, Nuc. Acids Res., 1998, 26, 1-7, which is 
incorporated herein by reference in its entirety, and can be accessed by those skilled in the art 
through the Internet at the world wide web at, for example, 
ncbi.nlm.nih.govAVeb/Genbank/index.htnil. Alternatively, partial nucleotide sequences of nucleic 
acid targets can be used when a complete nucleotide sequence is not available.- 

Please amend the paragraph beginning at page 17, line 25 of the specification to read as 

follows: 

-In another embodiment of the present invention, the nucleotide sequence of the nucleic acid 

target is determined by assembling a plurality of overlapping expressed sequence tags (ESTs). 

1 

The EST database (dbEST), which is known and available to those skilled in the art, comprises 
approximately one million different human mRNA sequences comprising from about 500 to 
1000 nucleotides, and various numbers of ESTs from a number of different organisms. dbEST is 
publicly available through the Internet at the world wide web at, for example, 
ncbi.nlm.nih.gov/dbEST/index.html. These sequences are derived from a cloning strategy that 
uses cDNA expression clones for genome sequencing. ESTs have applications in the discovery 
of new genes, mapping of genomes, and identification of coding regions in genomic sequences. 
Another important feature of EST sequence information that is becoming rapidly available is 
tissue-specific gene expression data. This can be extremely useful in targeting selective gene(s) 
for therapeutic intervention. Since EST sequences are relatively short, they must be assembled in 
order to provide a complete sequence. Because every available clone is sequenced, it results in a 
number of overlapping regions being reported in the database.- 

Please amend the paragraph beginning at page 18, line 9 of the specification to read as 

follows: 

-Assembly of overlapping ESTs extended along both the 5* and 3' directions results in a 
full-length "virtual transcript." The resultant virtual transcript may represent an already 
characterized nucleic acid or may be a novel nucleic acid with no known biological function. The 
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Institute for Genomic Research (TIGR) Human Genome Index (HGI) database, which is known and 
available to those skilled in the art, contains a list of human transcripts. TIGR is publicly available 
through the Internet at the world wide web at, for example, tigr.org/. The transcripts were generated 
in this manner using TIGR- Assembler, an engine to build virtual transcripts and which is known and 
available to those skilled in the art. TIGR-Assembler is a tool for assembling large sets of 
overlapping sequence data such as ESTs, BACs, or small genomes, and can be used to assemble 
eukaryotic or prokaryotic sequences. TIGR-Assembler is described in, for example, Sutton, et al. , 
Genome Science & Tech. , 1995, 1, 9-19, which is incorporated herein by reference in its entirety, and 
is publicly available through the Internet via file transfer program at, for example 
tigr.org/pub.softwaremGRassembler. In addition, GLAXO-MRC, which is known and available to 
those skilled in the art, is another protocol for constructing virtual transcripts. In addition, "Find 
Neighbors and Assemble EST Blast" protocol, which runs on a UNIX platform, has been developed 
by Applicants to construct virtual transcripts. Preferred steps in the Find Neighbors and Assemble 
EST Blast protocol is described in the flowchart set forth in Figure 2. PHRAP is used for sequence 
assembly within Find Neighbors and Assemble EST Blast. PHRAP is publicly available through the 
Internet at, for example, chimera.biotech.washington.edu/uwgc/tools/phrap.htm. One skilled in the 
art can construct source code to carry out the preferred steps set forth in Figure 2.- 

Please amend the paragraph beginning at page 19, line 23 of the specification to read as 

follows: 

-Sequence similarity searches can be performed manually or by using several available 
computer programs known to those skilled in the art. Preferably, Blast and Smith-Waterman 
algorithms, which are available and known to those skilled in the art, and the like can be used. Blast 
is NCBI's sequence similarity search tool designed to support analysis of nucleotide and protein 
sequence databases. Blast is publicly available through the Internet at the world wide web at, for 
example, ncbi.nlm.nih.gob/BLAST/. The GCG Package provides a local version of Blast that can be 
used either with public domain databases or with any locally available searchable database. GCG 
Package v.9.0 is a commercially available software package that contains over 100 interrelated 
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software programs that enables analysis of sequences by editing, mapping, comparing and aligning 
them. Other programs included in the GCG Package include, for example, programs which facilitate 
RNA secondary structure predictions, nucleic acid fragment assembly, and evolutionary analysis. In 
addition, the most prominent genetic databases (GenBank, EMBL, MR, and SWISS-PROT) are 
distributed along with the GCG Package and are fully accessible with the database searching and 
manipulation programs. GCG is publicly available through the Internet at the world wide web at, for 
example, gcg.com/. Fetch is a tool available in GCG that can get annotated GenBank records based 
on accession numbers and is similar to Entrez. Another sequence similarity search can be performed 
with GeneWorld and GeneThesaunis from Pangea. GeneWorld 2.5 is an automated, flexible, high- 
throughput application for analysis of polynucleotide and protein sequences. GeneWorld allows for 
automatic analysis and annotations of sequences. Like GCG, GeneWorld incorporates several tools 
for homology searching, gene finding, multiple sequence alignment, secondary structure prediction, 
and motif identification. GeneThesaurus l.Otm is a sequence and annotation data subscription 
service providing information from multiple sources, providing a relational data model for public 
and local data.-- 

• Please amend the paragraph beginning at page 20, line 24 of the specification to read as 
follows: 

-Another toolkit capable of doing sequence similarity searching and data manipulation 
is SEALS, also from NCBI. This tool set is written in perl and C and can run on any computer 
platform that supports these languages. It is publicly available through the Internet at the world 
wide web at, for example, ncbi.nlm.nih.govAValker/SEALS/. This toolkit provides access to 
Blast2 or gapped blast. It also includes a tool called tax.collector which, in conjunction with a 
tool called tax.break, parses the output of Blast2 and returns the identifier of the sequence most 
homologous to the query sequence for each species present. Another useful tool is feature2fasta 
which extracts sequence fragments from an input sequence based on the annotation. An 
exemplary use for this tool is to create sequence files containing the 5' untranslated region of a 
cDNA sequence.-- 

12 
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Please amend the paragraph beginning at page 21, line 30 of the specification to read as 

follows: 

-In another embodiment of the invention, the sequences required are obtained by 
searching ortholog databases. One such database is Hovergen, which is a curated database of 
vertebrate orthologs. Ortholog sets may be exported from this database and used as is, or used as 
seeds for further sequence similarity searches as described above. Further searches may be 
desired, for example, to find invertebrate orthologs. Hovergen is publicly available through the 
Internet via file transfer program at, for example, pbil.univ-lyon 1 .fr/pub/hovergen/. A database of 
prokaryotic orthologs, COGS, is available and can be used interactively through the Internet at 
the world wide web at, for example, ncbi.nlm.nih.gov/COG/.- 

Please amend the paragraph beginning at page 23, line 22 of the specification to read as 

follows: 

-Sequence homology between the window sequence of the target nucleic acid and the 
query sequence of any of the plurality of nucleic acid sequences obtained as described above, is 
preferably at least 60%, more preferably at least 70%, more preferably at least 80%, and most 
preferably at least 90%. The most preferable method of choosing the threshold is to have the 
computer automatically try all thresholds from 50% to 100% and choose a threshold based a 
metric provided by the user. One such metric is to pick the threshold such that exactly n hits are 
returned, where n is usually set to 3. This process is repeated until every base on the query 
nucleic acid, which is a member of the plurality of nucleic acids described above, has been 
compared to every base on the master target sequence. The resulting scoring matrix can be 
plotted as a scatter plot. Based on the match density at a given location, there may be no dots, 
isolated dots, or a set of dots so close together that they appear as a line. The presence of lines, 
however small, indicates primary sequence homology. Sequence conservation within nucleic 
acid molecules, particularly the UTRs of RNA, in divergent species is likely to be an indicator of 
conserved regulatory elements that are also likely to have a secondary structure. The results of 
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the interspecies sequence comparison can be analyzed using MS Excel and visual basic tools in 
an entirely automated manner as known to those skilled in the art.» 



Please amend the paragraph beginning at page 23, line 22 of the specification to read as 



follows: 



-In another embodiment of the invention, secondary structure analysis is performed by 
self complementarity comparison. Preferably, self complementarity comparison is performed 
using Compare, described above. More preferably, Compare can be modified to expand the 
pairing matrix to account for G-U or U-G basepairs in addition to the conventional Watson- 
Crick G-C/C-G or A-U/U-A pairs. Such a modified Compare program (modified Compare) 
begins by predicting all possible base-pairings within a given sequence. As described above, a 
small but conserved region, preferably a UTR, is identified based on primary sequence 
comparison of a series of orthologs. In modified Compare, each of these sequences is compared 
to its own reverse complement. Allowable base-pairings include Watson-Crick A-U, G-C 
pairing and non-canonical G-U pairing. An overlay of such self complementarity plots of all 
available orthologs, and selection for the most repetitive pattern in each, results in a minimal 
number of possible folded configurations. These overlays can then used in conjunction with 
additional constraints, including those imposed by energy considerations described above, to 
deduce the most likely secondary structure.-- 



Please amend the paragraph beginning at page 24, line 17 of the specification to read as 



follows 



-In one embodiment of the invention, secondary structure analysis is performed by 
alignment and covariance analysis. Numerous protocols for alignment and covariance analysis are 
known to those skilled in the art. Preferably, alignment is performed by ClustalW, which is available 
and known to those skilled in the art. ClustalW is a tool for multiple sequence alignment that, 
although notapartofGCG.can be added as an extension of the existing GCG tool setandused with 
local sequences. ClustalW is publicly available through the Internet at, for example, 
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dotimgen.bcm.tmc.edu:9331/multialign/0 P tions/clustalw.html. ClustalW is also described in 
Thompson et al, Nuc. Acids Res., 1994, 22, 4673-4680, which is incorporated herein by reference m 
itsentirety.Theseprocessescanbes^^ 

earlier steps Seqed, a UNIX command line interface available and known to those skilled in the art, 
allows extraction of selected local regions from a larger sequence. Multiple sequences from many 
different species can be clustered and aligned for further analysis- 



follows 



Please amend the paragraph beginning at page 25, line 1 1 of the specification to read as 

-Covariation is a process of using phylogenetic analysis of primary sequence information 
for consensus secondary structure prediction. Covariation is described in the following references, 
each of which is incorporated herein by reference in their entirety: Gutell, et al, "Comparat.ve 
Sequence Analysis Of Experiments Performed During Evolution" In Ribosoma! RNA Group I 
Introns, Green, Ed., AustimLandes, 1996; Gautheret, et al, Nuc. Acids Res., 1997, 25, 1559-1564; 
Gautheret, et al, RNA, 1995, 1, 807-814; Udmell, et al, Proc. Natl. Acad. Sci. USA, 1995, 92, 
10555-10559; Gautheret, et al, J. Mol Biol, 1995, 248, 21 A3; Gutell, Nuc. Acids Res., 1994, 22, 
3502-3517; Gutell, Nuc. Acids Res., 1993, 21, 3055-3074; Gutell, Nuc. Acids Res., 1993, 21, 3051- 
3054; Woese, Proc. Natl Acad. Sci. USA, 1989, 86, 3119-3122; and Woese, etal, Nuc. Acids Res., 
1980 8 2275-2293. Preferably, covariance software is used for covariance analysis. Preferably, 
Covariation, a set of programs for the comparative analysis of RNA structure from sequence 
alignments, is used. Covariation uses phylogenetic analysis of primary sequence informal for 
consensus secondary structure prediction. Covariation is publicly available through the Internet at 
the world wide web at, for example mbio.ncsu.edu^lNaseP/info/programs/programs.html. A 
complete description of a version of the program has been published (Brown, J. W. 1991 
Phylogenetic analysis of RNA structure on the Macintosh computer. CABIOS7:391-393). The 
current version is v4.1, which can perform various types of covariation analysis from RNA sequence 
alignments, including standard covariation analysis, the identification of compensatory base-changes, 
and mutual information analysis. The program is well-documented and comes with extenswe 
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example files. It is compiled as a stand-alone program; it does not require HyperCard (although a 
much smaller 'stack' version is included). This program will run in any Macintosh environment 
running MacOS v7.1 or higher. Faster processor machines (68040 or PowerPC) is suggested for 
mutual information analysis or the analysis of large sequence alignments.-- 

Please amend the paragraph beginning at page 26, line 5 of the specification to read as 

follows: 

-In another embodiment of the invention, secondary structure analysis is performed by 
secondary structure prediction. There are a number of aigorithms that predict RNA secondary 
stn.cn.res based on thermodynamic parameters and energy calculations. Preferably, secondary 
suucture prediction is performed using either M-fold or RNA Snucture 2.52. M-fold is pubhcly 
available through the Internet at the world wide web at, for example, ibc.wus.Udu/- 
Z uker/rna/form2.c g i or can be downloaded for local use on UNIX platforms. M-fold is also ava,lable 
as a part of OCG package. RNA Structure 2.52 is a windows adaptation of theM-fold algonthm and 
is publicly available through the Internet at, for example. 128. 15 1. 176.70/RNAstruc.ure.h,ml.- 



Please 



amend the paragraph beginning at page 26, line 29 of the specification to read as 

follows: 

-In another preferred embodiment of the invention, the output of AlignHits is read by a 
program called RevComp. A block diagram of this program is shown in Figure 9. This program 
could be reproduced by one skilled in the art. A preferred purpose of this program is to use base 
pairing rules and ortholog evolution to predict RNA secondary structure. RNA secondary 
structures are composed of single stranded regions and base paired regions, called stems. Since 
structure conserved by evolution is searched, the most probable stem for a given alignment of 
ortholog sequences is the one which could be formed by the most sequences. Possible stem 
formation or base pairing rules is determined by, for example, analyzing base pairing stat.st.cs of 
stems which have been determined by other techniques such as NMR. The output of RevComp 
is a sorted list of possible structures, ranked by the percentage of ortholog set member sequences 
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which could form this structure. Because this approach uses a percentage threshold approach, it 
is insensitive to noise sequences. Noise sequences are those that either not true orthologs, or 
sequences tha, made it into the output of AlignHits due to high sequence homology even though 
,hey do no, represent an example of the structure which is searched. A very simitar algorita.s 
implemented using Visual basic for Applications (VBA) and Microsoft Excel to be run on PCs, 
,o generate the reverse complement matrix view for the given set of sequences- 
Please amend the paragraph beginning at page 28, line 13 of the specification to read as 



follows: 



-After the hypothetical structure motifs are determined from the secondary structure 
analysis described above, a family of structure descriptor elements is construct*. Preferably, the 
structural motifs described above are converted into a famiiy of descriptor elements. An 
exemplary descriptor element is shown in Figure 6. One skilled in the art is familiar w„h 
consmtction of descriptors. Structure descriptors are described in, for example, Lafemere, et a.., 
Comput Appl.Biosci., 1994, 10,211-212, incorporated herein by reference in ,tsen.,rety. A 
different suture descriptor element is constated for each of the statural motifs idenffted 
from the secondary structure analysis. Briefly, the secondary structure is converted to a genenc 
,ex, suing, such as shown in Figure 6. For novel motifs, further biochemical analysis such as 
chemical mapping or mutagenesis may be needed to confirm structure predictions. Descriptor 
elements may be defined to have various stringency.- 

P,ease amend the paragraph beginning a. page 28, line 24 of the specification to read as 

follows: 

-For example, referring to Figure 6, the region termed HI, which comprises the flrst 
region of the stem, can be described as NNRNNN, which contemplates any complement^ base 
pairing including G-C, C-G, A-U, and U-A. The HI region may also be designated so as to 
include only C-G or A-U, etc., base pairing. In addition, the descriptor elements can be defined 
to allow for a wobble. Thus, descriptor elements can be defined to have any level of stnngency 

17 



PATENT 

DOCKET NO.: D3IS-0012 

desired by the user. Applicants' invention, thus, is also directed to a database comprising 

different descriptor elements. -- 

Please amend the paragraph beginning at page 29, line 10 of the specification to read as 
follows: 

-In one embodiment of the invention, nucleic acids having secondary structure which 
correspond to the structure descriptor elements are identified by searching at least one database. 
Any genetic database can be searched. Preferably, the database is a UTR database, which is a 
compilation of the untranslated regions in messenger RNAs. A UTR database is publicly 
available through the Internet via file transfer program at area.ba.cnr.it/pub/embnet/database/utr/. 
Preferably the database is searched using a computer program, such as, for example, Rnamot, a 
UNIX-based motif searching tool available from Daniel Gautheret. Each "new" sequence that 
has the same motif is then queried against public domain databases to identify additional 
sequences. Results are analyzed for recurrence of pattern in UTRs of these additional ortholog 
sequences, as described below, and a database of RNA secondary structures is built. One skilled 
in the art is familiar with Rnamot. Briefly, Rnamot takes a descriptor string, such as the one 
shown in Figure 6, and searches any Fasta format database for possible matches. Descriptors can 
be very specific, to match exact nucleotide(s), or can have built-in degeneracy. lengths of the 
stem and loop can also be specified. Single stranded loop regions can have a variable length. G- 
U pairings are allowed and can be specified as a wobble parameter. Allowable mismatches can 
also be included in the descriptor definition. Functional significance is assigned to the motifs if 
their biological role is known based on previous analysis. Known regulatory regions such as Iron 
Response Element have been found using this technique (see, Example 1 below). In 
embodiments of the invention in which a database containing prokaryotic molecular interaction 
sites is compiled, it is preferable to refrain from searching human sequences or, alternatively, 
discarding human sequences when found.- 
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Pte ase amend fte paragon beginning at page 32, .ine 26 of ,he specification to read as 

f0 " 0WS: -An ear.y step in the process is to use .he master sequence (nucieoUde or protein) to 
fm d and rank reiated sequences in the database (ortho.ogs and paralogs). Sequence simna„,y 
search algorithms ate used for this purpose. AH sequence simi.arUy aigoriurms caicuiate 
quantitative measure of simUaHty for each resuit compared with the master sequenc. An 
example of a quandtative result is an E-vaiue obtained from the Bias, aigonthm. Th E-vaiues 

sequence iHustrates the use of quanta analysis of sequence similarity searches. The E- a ue 

is he probability ma, a match between a query sequence and a da<abase sequence occurs d e . 

random chance. Therefore. ,he iower an E-va,ue ,he more likely tha, ,wo sequences are » y 
rApio.oftheiowes.E-vaiuesco.sforferHtinisshowninBgureT.Sequencestha.m^ 

and paraiogs, it is preferabie tha, the cu,„ff criteria no, be ,00 stringent, or the «ge, of ,he search 

may be excluded.- 

Piease amend ,he paragraph beginning a, page 33, iine 23 of the specification to read as 

f0 " 0WS Lwben the human mRNA and mouse mRNA sequences for feMtin, which each contain 
„ mE in ,he 5--UIS. are analyzed in this manner, a pio. showing the regions of sequence 
similar* is produced. Painvise analysis of the human and mouse femun mRNA sequences 

end of each mRNA. This demons tt a«s an important point - the sequence conservauon m Ore 

region of me IRE s,ruc, ure does no, s,and ou, agains, the background of sequence smulamy 
rLn.hehumanandmouseferH.insequences.mcon.asUnmecompaHsonofhumanand 
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trout or human and chicken ferritin mRNAs, the IREs can be immediately identified. This is 
because the sequence of the UTRs between human and trout or human and chicken are separated 
by greater evolutionary distance than human and mouse, which is logical in view of the 
evolutionary distance that separates humans from birds and fish compared with other mammals. 
Comparing the human sequence to that of birds and fish is informative because the natural dnft 
due to evolution has allowed many sequence changes in the UTRs. However, the IRE sequences 
are more constrained because they form an important structure. Thus, they stand out better and 
can be more readily identified.-- 

Please amend the paragraph beginning at page 34, line 28 of the specification to read as 
follows: 

-The software used in the present invention makes the decision whether or not to 
compare sequences pairwise using a lookup table based upon the evolutionary distances between 
species. An example of a small lookup table using the examples described above is shown ,n 
Figure 8. The lookup table in the present invention includes all species that have sequences 
deposited in GenBank. Q-Compare in conjunction with CompareOverWins decides which 
sequences to compare pairwise.- 

Please amend the paragraph beginning at page 35, line 4 of the specification to read as 

follows: 

-Sets of sequences that show evidence of conservation in orthologs and paralogs or other 
related genes are analyzed for the ability to form internal structure. This is accomplished by 
analyzing each sequence in a matrix where the seqeunce is plotted 5' to 3' on the X axis and Us 
reverse complement is plotted 5' to 3' on the Y axis, such as in, for example, self-complementary 
analysis. Matches that correspond to potential intramolecular base pairs are scored accordmg to a 
table of values. When the human ferritin IRE sequence is analyzed in this fashion, the diagonals 
indicate potential self- complementary regions. Each of the 13 IRE sequences described in th.s 
example were analyzed in the same fashion. While each of the sequences can form a variety of 
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different structures, the structure most likely to occur is one common to all the sequences. By 
superimposing the plots of all 13 individual sequences, the potential structure common to all the 
sequences is deduced. -- 

Please amend the paragraph beginning at page 36, line 5 of the specification to read as 
follows: 

-Hovergen was used to identify related sequences at the species and order levels. 
Sequences corresponding to each of these orthologs was saved in GenBank format and grouped 
together in a single data file. Untranslated regions in both the 5' and 3' flanks of the coding 
region was extracted using SEALS and COWX, as shown in Figure 11.- 

i 

Please amend the paragraph beginning at page 36, line 1 1 of the specification to read as 
follows: 

-The IRE sequences are more constrained because they form an important structure. 
Thus, they stand out better and can be more readily identified even in closely related sequences. 
However, for this to work for any gene, the compare algorithm has been rewritten (see, Figures 
5A-C).' This new tool, CompareOverWins, allows a dynamic selection of both the range of 
window sizes, as well the hit threshold. This algorithm needs as its input parsed and separated 5' 
and 3" UTR sequences. We use tools available within the Seals genome analysis package 
described earlier to achieve this. Figure 1 1 describes the steps involved.- 

Please amend the paragraph beginning at page 36, line 19 of the specification to read as 
follows: 

-To identify the IRE using the methods described herein, the compare over windows 
algorithm was used and the results visualized using AlignHits (Figure 5D for the algorithm). In 
addition to optimizing the thresholding, CompareOverWins also extracts the sequence 

esponding to the hits. ClustalW (version 1.74) was used on the extracted sequences to create 



com 
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a locally 
13.-- 



gapped alignment. A representative flow scheme for this approach is shown in Figure 



P1 ease amend the paragraph beginning at page 36, line 27 of the specification to read as 



follows: 



-Sets of sequences that show evidence of conservation in orthologs and paralogs or other 
related genes are analyzed for the ability to form internal structure. This is accomplished by 
anaiyzing each sequence in a matrix whete the seqeunce is plotted 5' to 3' on the X axis and ,ts 
complement is plotted 5'to3'ontheY axis, such as in, for example, self-complementary 
analysis. Matches that correspond to potential intramolecular base pairs are scored accordmg to a 
.able of values. When the human ferritin IRE sequence is analyzed in this fashion, the d,agonals 
indicate potential self- complementary regions. Each of the 13 IRE sequences described ,n th,s 
example were analyzed in the same fashion. While each of the sequences can form a vanety of 
different structures, the stntcture most likely to occur is one common to all the sequence, By 
superimposing the plots of all 13 individual sequences, the potential structure common to all the 
sequences is deduced.-- 

Please amend me paragraph beginning a. page 37, line 9 of the specification to read as 



follows 



The above scheme has been implemented algorithmically into a program called 
RcvComp (.see. Figure 9). RevComp creates a sorted list of a., the structures. Representee 
results can be viewed either as a "dome" ouptu, or as a "connect" or "cf file which can be use. 
in one of many RNA structure viewing programs (RNAStructure, RNAVtz, etc.). A 
representative example of such a structure drawing is shown in Figure 14,- 



Please amend the paragraph beginning a. page 37, line 21 of the specification to read as 
follows: 
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-Phylogenetic tree outputs for all Histone orthologs in Hovergen database was obtained. 

Each of these orthologs was saved in GenBank format and grouped together in a single data file. 

Untranslated regions in both the 5' and 3' flanks of the coding regions were extracted and 

compared using SEALS and COWX as described earlier {see, Figures 11 and 13). - 



Please 

follows 



amend the paragraph beginning at page 37, line 26 of the specification to read as 



-Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions. One such region is shown encircled]. The sequences 
corresponding to the region of interest was extracted from all species for alignment with 
CLUSTAL W (1.74). Following extraction of sequence information from Align Hits, CLUSTAL 
W (1.74) was used to provide multiple sequence alignment shown. Each of the putative hit 
sequences was analyzed for the ability to form internal structure. This was accomplished by 
analyzing each sequence in a matrix where the sequence was plotted 5' to 3' on the X axis and its 
complement is plotted 5' to 3' on the Y axis. Base-pairs along the diagonals indicate potential 
self-complementary regions that can form secondary structures. A representative sequence 
alignment in a dome format can show potential stem formation between the base pairs. 
Following conversion of the dome format file to a ct file, RNA Structure 3.21 is used to visualize 
the structure.-- 

amend the paragraph beginning at page 38, line 10 of the specification to read as 



Please ; 
follows: 

-Vimentin is an intermediate filament protein whose 3'UTR is highly conserved between 
species. Previous studies by Zehner et al., (Nuc. Acids Res., 1997, 25, 3362-3370) has shown 
that a proposed a complex stem-loop structure contained within this region may be important for 
vimentin mRNA functions such as mRNA localization. The same region was identified using 
the present analysis, thus validating the present approach. In addition, based on the analyses 
described herein, a second stem-loop structure that occurs downstream of the previously 
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proposed structure that may have a role in regulating vimentin fuction as well has been 

identified.- 



P1 ease amend .he paragraph beginning at page 38, hne 18 of the specification to read a, 



follows 



..A representative phylogenetic tree output for all Vimentin orthologs in Hovergen 
database was obtained. Each of these orthologs was saved in GenBank forma, and grouped 
togetherinasingle data file. Untranslated regions in both .he 5' and3' flanks of the codmg 
regions were extracted and compared using SEALS and COWX as described earner («, Ftgures 
11 and 13). - 

Please amend the paragraph beginning at page 38, line 23 of the specification to read as 

follows: . 

-Following extraction and comparison by SEALS and COWX, Align »<s was used to 
define potentially interesting region, Two such regions appeared, and were used for 
subsequent analyse, Following extraction of sequence information from Align Hit. for the firs, 
region CLUSTAL W was used .o provide multiple sequence alignment. Potential stem 
formation between base pairs was given above the sequence alignment in a dome forma,. 
Following conversion of the dome forma, file .o a c< file, RNA S«ruc.ure 3.21 was used .o 
visualize .he structure. This structure is very similar to the one proposed by Zehner « ai. 
Zehner « ai presented a detailed chemical analysis of their proposed structure for the mmtmal 
binding domain in .he 3' UTR of Vimentin. This analysis included cleavage with single-strand- 
specific (ChS or Tl) or double-s.rand-specif,c (VI) nucleases as well as after exposure to lead 
acetate.- 

amend the paragraph beginning at page 39, line 5 of the specification to read as 



Please 
follows: 
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-Following extraction of sequence information from Align Hits for the second region, 
CLUSTAL W was used to provide multiple sequence alignment. The potential stem formation 
between base pairs in the seeond region was given above the sequence alignment in a dome 
format. Following conversion of the dome format file to a c, fie. RNA Structure 3.21 was used 
to visualize the structure for the second region.- 

P,ease amend the paragraph beginning a, page 39, line 18 of the specification to read as 



follows: 



-A representative phylogenetic tree output for all Transferrin receptor onhologs ,n 
Hovergen database was obtained. Each of these orthologs was saved in GenBanR format and 
grouped together in a single data file. Untranslated regions in both the 5' and 3' flanks of 
coding region were extracted and compared using SEALS and COWX as described earlier («. 

Figures 11 and 13).- 

Pfcase amend the paragraph beginning a, page 39, line 23 of the specification to read as 



follows 



^-Following extraction and comparison by SEALS and COWX, Ahgn Hits was used to 
determine potential interesting regions. The firs, region, between base patrs 920 to 990 in*e 3 
prime UTR of transferrin receptor was extracted from all species for alignment w,,h CLUSTAL 
W(1.74).~ 

Please amend the paragraph beginmng a. page 39. line 28 of the specification to read as 

follows: 

-Following extraction of sequence information from Align Hits for the fust re P on, 
CLUSTAL W (1 74) was used to provide multiple sequence alignment. A representative 
potential stem formation between base pairs was given above the sequence a.ignment in a dome 
format. Following conversion of the dome forma, fi,e to a c, file, RNA Structure 3.21 was used 
,o visualize me structure. This can be seen where a vertica! line intersects a series of honzontal 
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lines representing sequence information from a set of species. This] The second region, between 
base pairs 990 to 1050, in the 3 prime UTR of transferrin receptor was extracted from all species 
for alignment with CLUSTAL W (1.74).-- 



Please 



amend the paragraph beginning at page 40, line 7 of the specification to read as 
follows: 

-Following extraction of sequence information from Align Hits for the second region, 
CLUSTAL W (1.74) was used to provide multiple sequence alignment. Potential stem formation 
between base pairs was given above the sequence alignment in a dome format. Following 
conversion of the dome format file to a ct file, RNA Structure 3.21 was used to visualize the 
structure. Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions. The third region, between base pairs 1372 to 1423, in 
the 3 prime UTR of transferrin receptor was extracted from all species for alignment with 
CLUSTAL W (1.74).-- 



Please 
follows: 



amend the paragraph beginning at page 40, line 17 of the specification to read as 



-Following extraction of sequence information from Align Hits for the third region, 
CLUSTAL W (l.Ex.34) was used to provide multiple sequence alignment. Potential stem 
formation between base pairs was given above the sequence alignment in a dome format. 
Following conversion of the dome format file to a ct file, RNA Structure 3.21 was used to 
visualize the structure. Following extraction and comparison by SEALS and COWX, Align Hits 
was used to determine potentially interesting regions. The fourth region, between base pairs 
1439 to 1479, in the 3 prime UTR of transferrin receptor was extracted from all species for 
alignment with CLUSTAL W (1.74).- 

Please amend the paragraph beginning at page 40, line 27 of the specification to read as 
follows: 
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-Following extraction of sequence information from Align Hits for the fourth region, 
CLUSTAL W (l.Ex.34) was used to provide multiple sequence alignment. Potential stem 
formation between base pairs was given above the sequence alignment in a dome format. 
Following conversion of the dome format file to a ct file, RNA Structure 3.21 was used to 
visualize the structure. Following extraction and comparison by SEALS and COWX, Align Hns 
was used to determine potentially interesting regions. The fifth region, between base pan-s 1479 
to 1542, in the 3 prime UTR of transferrin receptor was extracted from all species for alignment 
with CLUSTAL W (1.74).-- 

Please amend the paragraph beginning at page 41, line 6 of the specification to read as 
follows: 

-Following extraction of sequence information from Align Hits for the fifth reg.on, 
CLUSTAL W (1. Ex.34) was used to provide multiple sequence alignment. Potential stem 
formation between base pairs was given above the sequence alignment in a dome format. 
Following conversion of the dome format file to a ct file, RNA Structure 3.21 was used to 
visualize the structure.- 

Please amend the paragraph beginning at page 41, line 12 of the specification to read as 

-Orinithine decarboxylase (ODC) is the first enzyme in the polyamine biosynthetic 
pathway Studies have shown existence of translation^ regulatory elements both in the 5' and 3' 
untranslated regions (Grens et al., J. Biol. Chem., 1990, 265, 11810). Seconds structures have 
been proposed to exist in both these regions, though there is no conclusive evidence for ,L The 
methods described herein identified two structures in the 3' UTR, as shown below. The presence 
of one of these structures (see, Figure 15) was verified using mass spectrometry probmg (Gnffey, 
et al Proc SPIE-Int. Soc. Opt. Eng., 2985 (Ultrasensitive Biochemical Diagnostics II): 82-86, 
which is incorporated herein by reference in its entirety). Two representative sequences that 
showed slight variation in their lengths were made into RNA and subjected to MS structure 



follows 
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probing Results shown in Figure 15 confirm the presence of a stem-loop structure. Accordingly, 
identification of a novel secondary structure can be identified from the methods described herem, 
and such existence has been independently verified by structure probing.-- 

Please amend the paragraph beginning at page 41, line 25 of the specification to read as 
follows: 

-Phylogenetic tree outputs for all Ornithine Decarboxylase orthologs in Hovergen 
database were obtained. Each of these orthologs was saved in GenBank format and grouped 
together in a single data file. Untranslated regions in both the 5' and 3' flanks of the coding 
region were extracted and compared using SEALS and COWX as described earlier (««, Figures 
11 and 13).-- 



Please 



amend the paragraph beginning at page 42, line 1 of the specification to read as 



follows: 



.-Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions. Two such regions appeared, and were used for 
subsequent analyses. Following extraction of sequence information from the first region, 
CLUSTAL W (1.74) was used to provide multiple sequence alignment shown. Each of the 
putative hit sequences was analyzed for the ability to form internal structure in a reverse 
complement matrix. This was accomplished by analyzing each sequence in a matrix where the 
sequence is plotted 5' to 3' on the X axis and its complement is plotted 5' to 3' on the Y axis. 
Base-pairs along the diagonals indicate potential self- complementary regions that can form 
secondary structures. Domes view of the potential stem formation between base pairs in regun 1 
is given above the sequence alignment was determined using RevComp. RNA Structure 3.2 was 
used to visualize the structure.- 



Please 
follows: 



amend the paragraph beginning at page 42, line 13 of the specification to read as 
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-Mass spectrometry analyses techniques were used to probe for structure. The cluster 
alignment of the first region of ornithine decarboxylase 3' UTR showed presence of gaps/inserts 
in the multiple alignment. Two representative RNAs (gi404561 and gi35135) from the 
alignments were used for this experiment. Analysis of the pattern of induced fragmentation 
showed a very strong likelihood for base-paring along the top half of the stem-loop structure. 
This corresponds to bases 1 1-14 and 20-23 in 404561 or bases 8-1 1 and 18-21 in 35135. Bulged 
bases (G9 in 404561 or U22 in 35135) also showed characteristic fragmentation pattern. The 
bottom-half of the structure appeared to be less stable, and showed some fragmentation where 
our analyses had predicted base-paring. This was particularly true in the sequence 35 135. This 
region, however, has several contiguous A-U or G-U base-pairs which tend to be less stable, and 
therefore have a higher probability of fragmentation,- 

Please amend the paragraph beginning at page 42, line 24 of ^specification to read as 
follows: 

-Following extraction of sequence information from Align Hits for the second region, 
CLUSTAL W was used to provide multiple sequence alignment. Potential stem formation 
between base pairs in the second region was given above the sequence alignment in a dome 
format. Following conversion of the dome format file to a ct file, RNA Structure 3.21 was used 
to visualize the structure for the second region.-- 



Please 
follows: 



amend the paragraph beginning at page 43, line 2 of the specification to read as 



-A representative phylogenetic tree output for all IL-2 orthologs in Hovergen database 
was obtained. Each of these orthologs was saved in GenBank format and grouped together in a 
single data file. Untranslated regions in both the 5' and 3' flanks of the coding region were 
extracted and compared using SEALS and COWX as described earlier (**, Figures 1 1 and 13, 
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Please 



amend the paragraph beginning at page 43, line 7 of the specification to read as 
follows: 

-Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions in the 3UTR region. Two such regions appear, and were 
used for subsequent analyses. Following extraction of sequence information from Align Hits for 
the first region, CLUSTAL W (1.74) was used to provide multiple sequence alignment. Domes 
view of the potential stem formation between base pairs in the first region was given above the 
sequence alignment using RevComp. RNA Structure 3.2 was used to visualize the structure. 
Following extraction of sequence information from Align Hits for the second region, CLUSTAL 
W (1.74) was used to provide multiple sequence alignment. Potential stem formation between 
base pairs in the second region was given above the sequence alignment in a dome format. 
Following conversion of the dome format file to a ct file, RNA Structure 3.21 was used to 
visualize the structure for the second region.- 

Please amend the paragraph beginning at page 43, line 20 of the specification to read as 
follows: 

'-In addition to the two regions described above, a third region, downstream of, and 
partially overlapping the second region, was identified using an alternate reference sequence 
(3087784.fa). Following extraction of sequence information from Align Hits for this region, 
CLUSTAL W (1.74) was used to provide multiple sequence alignment. Potential stem formation 
between base pairs in the third region was shown above the sequence alignment in a dome 
format. Following conversion of the dome format file to a ct file, RNA Structure 3.21 was used 
to visualize the structure for the third region.- 



obtained 



amend the paragraph beginning at page 43, line 29 of the specification to read as 

Representative phylogenetic tree output for all IL-4 orthologs in Hovergen database was 
ined. Each of these orthologs was saved in GenBank format and grouped together in a single 



Please 
follows: 
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data file. Untranslated regions in both the 5' and 3' flanks of the coding region were extracted 
and compared using SEALS and COWX as described earlier (see, Figures 11 and 13). - 



Please 
follows 



amend the paragraph beginning at page 44, line 4 of the specification to read as 



-Following extraction and comparison by SEALS and COWX, Align Hits was used to 
determine potentially interesting regions in the 5'UTR region. Following extraction of sequence 
information from Align Hits for the above region, CLUSTAL W (1.74) was used to provide 
multiple sequence alignment. Domes view of the potential stem formation between base pairs m 
the region was given above the sequence alignment using RevComp. RNA Structure 3.2 was 
used to visualize the structure.-- 

Please amend the paragraph beginning at page 44, line 1 1 of the specification to read as 
follows: 

-Align Hits was used to view hits in the 3'UTR region of IL-4. Following extraction of 
sequence information from Align Hits for the 3' UTR region, CLUSTAL W (1.74) was used to 
provide multiple sequence alignment. Potential stem formation between base pairs in the second 
region was given above the sequence alignment in a dome format. Following conversion of the 
dome format file to a ct file, RNA Structure 3.21 was used to visualize the structure for the 
second region.-- 



In the Claims: 

Please amend claims 35, 43 and 51 to read as follows. 

35. (Twice Amended) An oligonucleotide comprising a molecular interaction site that is present 
in the RNA of a selected organism and in the RNA of at least one additional organism, wherein said 
molecular interaction site serves as a binding site for at least one molecule that when bound to said 
molecular interaction site modulates the expression of said RNA in said selected organism, wherein 
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